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Abstract 



We give lower bounds on the growth rate of Dejean words, i.e. minimally repetitive words, over a fc- letter 
alphabet, for 5 < k < 10. Put together with the known upper bounds, we estimate these growth rates with 
the precision of 0.005. As an consequence, we establish the exponential growth of the number of Dejean 
\Q ■ words over a fc-letter alphabet, for 5 < k < 10. 

1. Introduction 

^"j ■ Let w — ai ■ ■ ■ a n be a word over an alphabet E. The number n is called the length of w and is denoted 

by \w\. The symbol ai of w is denoted by w[i]. A word Of • • • Oj, where 1 < i < j < n, is called a factor of w 
and is denoted by w[i : j]. For any i = 1, . . . , n the factor w[l : i] (w[i : n\) is called a prefix (a suffix) of w. 
A positive integer p is called a period of w if a,; = ai +p for each i = 1, . . . , n — p. If p is the minimal period 
of w, the ratio e(w) — n/p is called the exponent of w. Two words w' ,w" over £ are called isomorphic if 
\w'\ = and there exists a bijection er : S — > £ such that u>"[i] = er(it/[i]), i = 1, . . . , \w'\. By /C(w) 
we will denote the set of all words over £ which are isomorphic to the word w. We also denote by |^4| the 
number of elements of a finite set A. Let |S| = k. It is easy to note that |/C(w)| = fc! if w contains at least 
CO ■ k — 1 different symbols of S. 

I/"") | Let W be an arbitrary set of words. This set is called factorial if for any word w from W all factors of w 

are also contained in W . We denote by W(n) the subset of W consisting of all words of length n. If W is 
factorial then it is not difficult to show (see, e.g., 0,[l[) that there exists the limit limn^oo A(/|iy(n)| which 
is called the growth rate of words from W. For any words u, v we denote by W^ v \n) the set of all words 
from W(n) which contain v as a suffix, and by W^ u,v \n) the set of all words from W{n) which contain v as 
a suffix and u as a prefix. 

One can mean by a repetition any word of exponent greater than 1. The best known example of repetitions 
is a square; that is, a word of the form uu, where u is an arbitrary nonempty word. Avoiding ambiguitjO], 
by the period of the square uu we mean the length of u. In an analogous way, a cube is a word of the form 
uuu for a nonempty word u, and the period of this cube is also the length of u. A word is called square-free 
[cube-free) if it contains no squares (cubes) as factors. It is easy to see that there are no binary square- free 



words of length larger than 3. On the other hand, by the classical results of Thue [20Ll21|. there exist ternary 
square-free words of arbitrary length and binary cube-free words of arbitrary length. For ternary square-free 
words this result was strengthened by Dejean in Q. She found ternary words of arbitrary length which have 
no factors with exponents greater than 7/4. On the other hand, she showed that any long enough ternary 
word contains a factor with an exponent greater than or equal to 7/4. Thus, the number 7/4 is the minimal 
limit for exponents of avoidable factors which is universally called the repetition threshold in arbitrarily long 
ternary words. Dejean conjectured also that the repetition threshold in arbitrarily long words over a fc-letter 
alphabet is equal to 7/5 for k = 4 and fc/(fc — 1) for fc > 5. This conjecture is now proved for any fc through 
the work of several authors @, H, 0, !, El 0, EE [3 ■ 
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Denote the repetition threshold in arbitrarily long words over a fc-letter alphabet by ipk- In the paper we 
will call the words having no factors with exponents greater than ipk minimally repetitive words or Dejean 
words. By S^(n) we denote the number of all minimally repetitive words of length n over a fc-letter 
alphabet. Note that the set of all minimally repetitive words is obviously factorial. So for any fc there exists 
the growth rate 7^ = lim„_ i . 00 yj (n). 

The problem of estimating the number of repetition-free words has been investigated actively during the 
last decades (reviews of results on the estimations for the number of repetition-free words obtained before 
2008 can be found in 0, [H|). The most progress in this field has been made for the case of binary alphabet. 
In this case Dejean words reduce to overlap-free words which are also a classical object for combinatorial 
investigations. It is proved in 17| that the growth of the number of binary overlap-free words is polinomial. 
Actually, binary over lap- free words of each length are counted by a 2-regular function 0] ■ 

In jllfl we proposed a new approach for obtaining lower bounds on the number of repetition-free words. 
Using this approach, we obtained precise lower bounds for the growth rates of ternary square-free words, 
binary cube-free words, and ternary minimally repetitive words. This approach proved to be very effective. 
In particular, in [l9j Shur proposed an interesting modification of our approach which allows to compute 
more effectively lower bounds for the growth rates of words which contain no repetitions of exponent greater 
than or equal to a given bound if this bound is not less than 2. The direction of our further investigations 
in this field is testing the proposed approach for "extreme" cases when the prohibitions imposed on words 
are maximal possible for the existence of words of arbitrary length avoiding these prohibitions. These cases 
are obviously the most diffucult for obtaining lower bounds on the number of appropriate words. The case 
of minimally repetitive words is a natural example of such "extreme" cases. Moreover, the general case of 
minimally repetitive words over a fc-letter alphabet for k > 5 when tp^ = kj(k — 1) is the most interesting for 
us. So this paper is devoted to obtaining lower bounds on 7W for fc > 5 by using the proposed approach. 
Note that the method proposed in ll| is not directly applicable to resolving this problem because of the 
huge size of required computer computations. In this paper we propose an improvement of this method 
which requires significantly fewer computer computations. Using this improvement, we obtain lower bounds 
on 7W for 5 < fc < 10 which have the precision of 0.005. As an evident consequence of these results, we 
establish the exponential growth of the number of minimally repetitive words over a fc-letter alphabet for 
5 < fc < 10 (for fc = 3, 4 this fact was proved by Ochem in |l4|). 



2. Estimation for the number of minimally repetitive words 

2.1. General 

For obtaining a lower bound on 7'*' we will consider the alphabet = {ai, a 2 , . . . , a^} where fc > 5. 
We denote the set of all minimally repetitive words over by J- '. By a prohibited factor we mean a 
factor with an exponent greater than fc/(fc — 1). Let to be a natural number, to > fc, and w',w" be two 
words from T(m). We call the word w" a descendant of the word w' if w'[2 : to] = w"[l : m — 1] and 
w'w"[m] = w'[l]w" E J-(m+l). The word w' is called in this case an ancestor of the word w" . We introduce 
a notion of closed words in the following inductive way. A word w from J-(m) is called right closed (left 
closed) if and only if this word satisfies one of the two following conditions: 

a) Basis of induction, w has no descendants (ancestors); 

b) Inductive step. All descendants (ancestors) of w are right closed (left closed). 

A word is closed if it is either right closed or left closed. We denote by Pirn) the set of all words from T{rn) 
which are not closed. By C m we denote the set of all words over such that the length of these words is 
not less than to and all factors of length to in these words belong to JF{rn). We also denote by J- m the set 
of all minimally repetitive words from C m . Note that a word w is closed if and only if any word isomorphic 
to w is also closed. So we have the following obvious fact. 

Proposition 1. For any isomorphic words w',w" and any n > \w'\ the equality \Fm \n)\ = \J~m (n)\ 
holds. 
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A word will be called rarefied if the distance between any two different occurences of the same symbol in 
this word is not less than k — 1. 



Proposition 2. Any word from C m is rarefied. 

Proof. Let w be an arbitrary word from C m . Assume that w[i] = w[j] where j<i<j + (k — 2). Consider 
the factor f = w[j : i\. Since \f\=i — i + 1 < & — 1 < m, into the factor / is contained in some factor /' 
of length m. By the definiton of C m we have /' £ ^(m), so / £ T . On the other hand, / has the period 
l/l - 1, so 



l/l-l i-j ~k-2 k-l 
which contradicts the definiton of J-(m). □ 

A word w of length n > k — 1 over will be called trimmed if io[n — {k — 1) + j] = aj for j = 1, . . . , k — 1. 
We denote by P'(m) the set of all trimmed words from JF(m). Taking into account Proposition [21 it is not 
difficult to note that for any word from J-"(m) there exists a single word from F'(m) which is isomorphic 
to this word, and for any word from !F'(m) there exist exactly k\ different words from T{m) which are 
isomorphic to this word. Thus |.F(m)| = fc!|.F'(m)|. Let w',w" be two words from jF'(m). We call the word 
w" a quasi- descendant of the word w' if w" is isomorphic to some descendant of w' . The word w is called 
in this case a quasi- ancestor of the word w" . 

Let s = \J-(m)\ and s = \J r '(m)\. Without loss of generality we can assume that J-(m) = u>2, . . . , w$} 
where J-'(m) = {w±, W2, ■ ■ ■ , w s }. For any word w from J-(m) we will denote by i(w) the serial number of w 
in J-(m), i.e. i(w) = i if W = Wi for some i = 1, 2, . . . , s. We define a matrix A m = of size s x s in 
the following way: Sij — 1 if and only if ua is an ancestor of wf, otherwise Sij = 0. For any natural t by 
Am = i^if) we will denote the f-th power of the matrix A m , i.e. 

= A m x A m x . . . x Am . 
t 

Further we use the following evident fact. 

Proposition 3. For any i,j = l,2,...,s and any n > m the equality \Cm t ' W3 \n)\ — 5^ is valid. 

We also define a matrix A m = (&y) of size s x s in the following way: <5jj = 1 if and only if uii is 
a quasi-ancestor of uij] otherwise 5^ = 0. Note that A m is a nonnegative matrix, so, by the Perron- 
Frobenius theorem, for A m there exists some maximal in modulus eigenvalue r which is a nonnegative real 
number. Moreover, we can find some eigenvector x — (x\) . . . ; x g ) with nonnegative components which 
corresponds to r. Assume that r > 1 and all components of x are positive. Then we denote by 11 the ratio 
max i=1 a Xi/ min,-! gXj, and for n > m we define sffl (n) — 1 Xi ■ \J~m (ri)\. In an inductive way 
we estimate Sm (n + 1) by Sm\n). 

First we estimate \Tm\n + 1)| for each w £ P(m). It is obvious that 

+ 1)1 = \Q [w \n + 1)| - \H (w) (n + 1)|, (1) 

where Q^ w \n + 1) is the set of all words v from Cm\n + 1) such that v[l : n],v[n — m + 1 : n + 1] £ T, 
and %( w \n + 1) is the set of all words from Q^ w \n + 1) which contain some prohibited factor as a suffix. If 
w £ JF'{m) we denote by ir(w) the set of all quasi-ancestors of w. Taking into account Proposition [TJ it is 
easy to see that 

\g^(n+l)\= \^(n)\- (2) 
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Therefore, using that x is a eigenvector of A m for the eigenvalue r, we obtain 

E"=i &< ■ \gM(n + i)\ = E-=i(^ ■ E, e ^) l^(n)|) 

<5 s i \ / I^V)! \ 



/ $21 

$12 $22 



V Si 



>2s 



) 



(w 2 ) 



(n)\ 



\ \?£ s) (n)\ ) 



( \^\n)\ \ 



r ■ (xi;x 2 ; ...;x s ) 



(w 2 ) 



HI 



= r-SlH } (n) 



(3) 



We now estimate \HS w ^{n + 1)|. For any word v from H^ w \n + 1) we can find the minimal prohibited 
factor which is a suffix of v. We denote this factor by h(v) and the minimal period of this factor by 
X(v). Since after removing the last symbol from h(v) this factor can not be prohibited, we have actually 
\h(v)\ = [kX(v)/(k - 1)J + 1. Note that the value X(v) is not less than p = (m+ 1) - |_(m + l)/fcj. Thus 



W(»+l)|=£|7#°(n+l)l 

j>PO 



(4) 



where Wij"'{ri + 1) is the set of all words v from %^ w \n + 1) such that X(v) = j. 

2.2. Upper bound for \"Hj (n + 1)| 

To estimate \Hj (n + 1)|, let x{j) = Lj'/(^ — 1)J +1 an( l let t = j +x(i) + 1. Recall that for any v from 
"Hj (n + 1) the prohibited factor h(v) is a word from + x(j)) with the minimal period j. Moreover, 

this word doesn't contain shorter prohibited factors and contains the word w as a suffix. 

Let Xffi be the set of words v G C m (t) such that v[l : t-1] G F(t-l), v[3 : t] G F{t-2), v[t-j~x(j) + 1 : 
t — j] = v[t — x(j) + 1 :t] and it; is a suffix of v. Note that for every v G -^j t 5 v \t ~ J ~ x(j)] ^ V \P ~ x{j)]i 
otherwise v[l : t — 1] would have a forbidden factor. Suppose that n + 1 >t and let it G %^\n +1). Then 

u[r^ + 1 : n + 1] = u[n" + 1 : n - j + 1] (5) 



where n$ = n-|j/(fc-l)J and = n- [fcj/(fc-l)J . By definition of H^in+l), u[n+l-t+l : n+1] G X£ 
Moreover u[l : n + 1 — i + m] G -F(n + 1 — t + m). Thus we have 



(w) 

t ■ 



Proposition 4. 



Hf\n + 1)\< \J* v[1 ' m]) (n+l-t + m)\. 



Let be the multiset of all prefixes of size m in words of X^' (note that among words Uj ™' we can 
have identical words, i.e., the same word can be a prefix of different words of X^ and so can be counted 
several times in Uj™ ). Then Proposition 0] implies 



Uf\n + 1)\< £ |J*°(n + l-t + m) 
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For I = 1, . . . , s, denote by Q t (w) the number of occurrences of wi in the multiset U) w t > . Then 

|^ w) (n + l)|< J2 \^ ) (n+l-t + m)\ = j2cSn-\HT ) (n + l-t + m)\. (6) 

2.3. Weaker upper bound for \U { J w \n+ 1)| 

We can also obtain another estimation for \7i^\n + 1)| where w G P'{m). This estimation is more 

rough in comparison with (J6j) but requires much fewer computer computations. To estimate [Hj (n + 1)| 
by this way, we denote [j/{k — 1)J + 1 by an d assume that > k — 1 and n > j + to. Recall that 
for any v from H^fo + 1) we have relation (|5|). We consider separately the two following cases: < m 
and xij) > TO - 

Let < m- For any w from Hj (n + 1) denote by f'(y) the factor v[rt + 2 — j — to : n + 1 — j] of u. It 

follows from v G £ m that /'(«) G J-(m). Moreover, from ((5|) we obtain that f'(v) and w have the common 
suffix of length x(j)- Since w G P' (m) and x(j) > fc — 1, it implies that /' (u) G P'{m). Thus 

|W<™>(n + l)| = 2 \^u(n + l)\ 

where Wj (w) is the set of all words from J-"'(m) which have the common suffix of length x(J) with the word w, 

and i n + 1) is the set of all words v from Hj (n + 1) such that f'(v) = u. To estimate (n + 1)|, note 

that for any v fromlj™' 1 (n+1) we have v[l : n+1— j] G J^m ' (n+1— j) and u[n+2— j— m : ra+ 1] G ^(j+m). 
Hence, using Proposition [3J we obtain 

|2£?(n + 1)| < |j£> (n + 1 - i)| • + ™)l = I^C" + 1 " i)l ' 

Thus, in this case we get the estimation 

u£Wj{w) 

Let now > m. For any u from "Hj(n + 1) denote by f"(v) the factor + 1 : n" + to] of u. It 
follows from v G £ m that f"(v) G P(m). Thus in this case 

|*<"»(n + l)l = £ + 

where J^{n + 1) is the set of all words v from H < J w \n + 1) such that /"(w) = it. To estimate \ J^{n + 1)|, 
consider an arbitrary word v from ^7^™ (n + 1). Note that v is determined uniquely by the prefix v[l : n'j +m] 
which satisfies the following conditions: v[l : n" + to] G Fm\n" + to), + 1 : n + 1 — j] G £m (x(i))) 
and u[n + 2 — j — m : nfj +m] G £^' u ^(j + 2m — x(j))- Hence, using Proposition [3j we obtain 

\J^J(n + l)\ < \^Hn'j + m)\-\Ct w Hx(j))\-\^ u) (j + ^rn-x(M 

= \j&> « + m) I ■ ■ ft+rv^ ■ 

I m V j i\ l{u),l{w) l(w),l(u) 

Thus, in this case we get the estimation 

\76 w) (n + 1)1 < Y <5 ( f ( ^7" l) • 5 ( /+ m 7^ j)) • \J*p « + m) | . 

tt£^*(-m) 
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Taking into account Proposition [TJ we can rewrite this estimation in the form 



\H?\n+l)\< E \J*l\n'! 



(8) 



y v£fc(u) 



Note that, unlike estimation (|6]), estimations ([7]) and ([8]) can be computed in polynomial time. 



24. Estimation of \U {w) (n + 1) 

We fix numbers pi,P2 such that po < pi < P2 and P2 > 2fc — 3, and assume for convenience that 
n > kp2/(k — 1). We present sum (j4} in the form 

Pl P2 

\n^(n + i)\=J2\nl w \n + i)\+ E \nl w \n + i)\ + \ii^{n + i)\ 

j=Po J=Pi+l 



where "H^n + 1) = U J>P2 (n + 1). Thus £? =1 a^H^n + 1)| can be presented as 

Pl S P2 S S 

£^x i |^- ) (n+l)|+ £ ^x l |^ ) (n+l)l + E^^ (U ' l) ^ + 1 )l- 



(9) 



j=Po i=l 3=Pl+l i=l 

To estimate the first sum in (|9]), we use inequality 

Pl s Pi s 



j=Po «=1 



j'=Po i=l i=l 

E E^H-^^ 



fc-i 



fc-i 



1 + m ) 



1 + m) 



(10) 



where r)[{d) = $^* =1 ' Cj'j *j j +2 ( Wi ) ^ there is a j such that jttiJ + 2 = d, and = otherwise. 

To estimate the second sum in ([9]), we use inequalities and (j8]l . In particular, in the case of x(j) ^ m i 
using inequality Q and taking into account that u S Wj (iu) if and only if w G (it) , we obtain 

E>i«?*>(n+i)i < E E ^Syi^^+i-i)! 

i=l i=l ueWj (wi ) 



E i^ u) (^+i-i)i E 

ueP(m) \wieWjiu) 



1 WieWj(wi) 
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In the case of xO) > m i using inequality ([5]). we have 



5>|«<"*>(n + l)l < E E ^I^K' + m)! E 



9 tf) 

i,v 



i=l , 



^e/c(u) 



Ei^ !) K+hiE^( E € 

i=l \nG/C(to,) 



2=1 



g(i) _ c(x(j)-™) f(j+"»-x(j)) 

[kj/(k — 1)J — m for the case of > m, we conclude that 



where fffl = S)^{ m> ■ <5„ U I"' xyj " . Thus, defining = j - 1 for the case of X (j) < m and d(j) 



where 



Hence 



^2 Xi \nl Wi \n + l)\<^2m-\^ l Hn-d(j))\ 



1=1 



E Xi ' ' if x0')< m ; 

u)iGWj(toi) 



60) = 



E^ E if x(j)>m. 

I, *=1 \i>G/C(u>,) 



P2 S 



P2 S 



E E^i^i !) («+!)i< E E&tfH-^V-^))!- 

j=Pi + l i=l j= pi +i ; = i 

We define £,[{d) = if there exists some j such that = d, and £;(<f) = otherwise. Then 

P2 s ds s 

E E^-i^V-iODhE E^)-i^ !) (^-rf)i 

where d 2 = d(pi + 1), c?3 = d(p 2 )- 
Summing up (flU)) and (fTTj) . we get 



(11) 



E E^i^t" + !)i ^ E E w '( d ) • i-^°( n - d )i 

j—po i—1 d—a l—l 

where w/(d) = ^(d) + $(d), a = min(d 2 , + 2 - m - 1) and 6 = max(d 3 , 1 +2 - m - 1). 

We majorate this sum by some sum J2d=a P d ' ^m* ( n ~ d) m the following way. We compute consecutively 
coefficients pa of this sum for d = a, a + 1, . . . , b. For each d = a, a + 1, . . . ,b — 1 together with the number 
we compute also numbers Lo[(d + 1), . . . , u)' s {d + 1) such that 

E E w ^') • l^'V-i)! < E^+ !) • l^ W,) ( n -d - 1)| + EPi ■ S^{n-j). (12) 

j— a /— 1 j—a 
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For d = a we take p a = mini<z< s (a;/(a)/x/). Then 

s s 

5>,(a) • |J#«>(n - o)| - Pa ■ SW(n - a) + E*i • ^(r 



z=i z=i 

where z^ = u>i(a) — p a - %i, I — 1, . . . , s. Denote by z/ the vector (vi; . . . ; z/ s ) and consider the vector v' = A m i>. 
Let v' = {v' x \ . . , ; v' s ). It follows from <|TJ> and © that 

\J*?\n-a)\<\QW{n-a)\= E |j*0( n _ o -l)| 
for any £ = 1, . . . , s. Note also that v\ > for I = 1, . . . , s. Hence 

J2^-\^n l) (n-a)\ < ]TU- Yl \^(n-a-l)\ 

L=l 1=1 y ven(wi) 

s 

1=1 

Thus 

a+1 s s 

E 5>(j) • |J*°«>(n- j)| < p a ■ S$(n - a) +J» + 1) ■ l^'H" - a - 1)1 (13) 

j=a 1=1 1=1 

where u)[(a + 1) = a>;(a + 1) + f[. Assume now that for some d such that a < d < b we already computed 
the numbers p a , . . . , pd-i and u4(d), ■ • • , u' s (d). Then we take = mini<;< s (cj ; '((i)/x;), P = (tt^(d) — pd ■ 
x\ 1 . . . ,uj' s {d) — pa - x s ) 1 and v 1 — A to za We take also uj[(d-\- 1) = oj;(<i+ 1) + z^' where is the /-th component 
of the vector v' ', I — 1, . . . , s. Analogously to inequality (fT5|) . in this case we have the inequality 

s 

E ( w ,'(d) • - «oi +wi(d+ 1) • i4r !) (« - d - 1) 

< Pd ■ S<f>(n - d) +XX(d + 1) • l^fr - d - 1)|. 
1=1 

This inequality implies that inequality (|12[) holds for every d. For d = b we take p;, = maxi<;< s (o; i '(&)/2;i). 
Thus, 

b s b 

E E w < (<o • (" - d ) i ^ E w • } (» - d ) 



which implies 



P2 S f) 

E E + 1)1 < X> • ^(n - d). (14) 

j'=po »=1 d=a 



£5. Upper bound for \n^(n + 1) 

We estimate finally the sum J2i=i Xi\W- Wi '{n + 1)|. For this purpose we denote by ii(n + 1) the set 
ULi H {m) (n + 1) and by H'(n + 1) the set U-=i H [m) (n + 1). Note that the sets H( Wi \n + 1) are non- 
overlapping, so \U'(n + 1)| = Ei=i |^ (M,l) (« + 1)1- Thus 

S 

^ Xi\n im) (n + 1)\ < \n'(n + l)\ • max x t . (15) 

^ — ' i= 1 s 

1=1 
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Moreover, since by Proposition [5] any word from H(n + 1) is rarified and n + 1 > k — 1, for any word from 
T-l(n + 1) there exists a single word from T-L'(n + 1) which is isomorphic to this word, and for any word 
from Til (n + 1) there exist exactly fc! different words from %(n + 1) which are isomorphic to this word. So 
\H(n + l)\ = k\\U'(n + l)\. 

Let v be an arbitrary word from 7i{n + 1). Then for v we have 



V 71 



fc- i 



+ 1 : n + 1] 



fc - 1 



+ 1 : n' + 1] 



where n' = n — X(v). Thus the word v is determined uniquely by the number X(v) and the prefix v[l : 
n — [X(v)/(k — 1)J]. We denote this prefix by t(v). Further we use the following fact. 

Lemma 5. For any different v',v" G H(n + 1) the prefixes t(v'),t(v") are also different. 

Proof. Let t(u') = t(v") = u for some different v',v" G H(n + 1). Denote by Z the length of u. Note that 
v',v" G £ m , so u', v" and u are rarefied by Proposition [2] Thus without loss of generality we can assume 
that u is trimmed, i.e. 

ctj = u[l - (fc - = v'[l - (fc - 1) +j] = v"[l - (fc - 1) +j] (16) 

for j = 1, . . . , fc — 1. As we noted above, the equalities t(v') = t(v") and A(u') — X(v") imply v' = v". So 
A(u') ^ A(v"). Without loss of generality we assume that A(u') > A(u"). Since n — I = [X(v')/(k — 1)J = 
LA(u")/(fc — 1)J, we can assume moreover that X(v") < A(u') < A(u") + (fc — 1). Note also that the inequality 
[x(v')/(k - 1)J > 2 follows from A(u') > p 2 + 1 > 2fc - 2. So i = n - [X(v')/(k - 1)J < n - 2. Recall that 
we have also 

v'[l + 1 : n + 1] = «'[Z - A(u') + 1 : n - A(u') + 1] = up - A(w') + 1 : n - X(v') + 1], (17) 

v"[l + 1 : n+ 1] = v"[l - X(v") + 1 : n - X(v") + 1] = u[2 - A(«") + 1 : n - A(«") + 1]. (18) 

Suppose =u"[Z + l]. Then by equations (HH) and dHJ) we obtain up- X(v') + 1] = u[l- X(v") + 1\. 

Since 

(2 - X(v") + X(v') + 1) = X(v') - X(v") < fc - 2, 

this contradicts that u is rarefied. So v'[l + 1] ^ u"[Z + 1]. Since v' , v" are rarefied, it is easy to note 
from (fT6|) that + 1] and v"[l + 1] can be either a\ or au- So we have only two possible cases: v'[l + 1] = a\, 
v"[l + 1] = au or v'[l + 1] = dfc, u"[Z + 1] = a\. We consider these cases separately. 

Let u'p + l] = ai and «"[Z + 1] = a&. Then it is easy to note that the symbol w'[Z + 2] can be only afe. Thus, 
by equations (fT7|) and (|T5j) we obtain up — A(w') + 1] = ai, up — X(v') + 2] = at and up — X(v") + 1] = afc. 
So up - A(u') + 2] = up-A(u") + l]. Since 

(Z - X(v") + 1) - (Z - A(v') + 2) = A(u') - A(t>") - K k - 1 

and u is rarefied, the only case we have to consider is I — X(v") + 1 = I — A(t/) + 2, i.e. A(i/) — A(u") = 1 
(in this case u[Z — A(u') + 2] and up — A(u") + 1] are the same letter in u). Since v" is rarefied, v"[l + 2] 
can be either a\ or 02- If v"[l + 2] = ai, then by fig)) we obtain ai = up — X(v") + 2] = up — X(v') + 3]. 
Thus we have in this case that up — A(V) + 1] = u[Z — A(w') + 3] which contradicts that u is rarefied since 
2 < fc — 1. Let z/'p + 2] — G2- Then it is easy to note that the symbol v"[l + 3] can be only a\. Therefore, 
01 = up - X{v") + 3] = up - A(u') + 4] by Thus we have that up - A(u') + 1] = up - A(V) + 4] which 

contradicts again that u is rarefied. 

Let now v'[l + 1] = au and u"[Z + 1] = a\. Then it is easy to note that the symbol v"[l + 2] can be only au- 
Thus, by equations ([T7J) and JTHJ) we obtain up- A(u') + 1] = a fe , u[l-X(v")+l] = a x and u[Z-A(u")+2] = a k . 
Since u is rarefied, we have 

(Z - A(«") + 2) - (Z - A(w') + 1) = A(u') - AO") + 1 > fc - 1. 
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Thus X(v') — X(v") = k — 2 has to be valid in this case. Since v' is rarefied, we have also that v'[l + 2) can 
be either a\ or d2- If v'[l + 2] = oi, then u[l — X(v') +2] = ai by (JTTJ) . Since u[l — X(v") + 1] = ai and 

(I - A(t>") + 1) - (i - X(v') + 2) = X(v') - X(v") - l = k-3<k-l, 

this contradicts that u is rarefied. Let v'[l + 2] = a 2 . It is easy to note that in this case the symbol v'[l + 3] 
can be only a±. Therefore, u[l — X(v') +3] = a\ by (fTTJ). Taking into account that u[l — X(v") + 1] = ai and 
k > 5, we obtain again a contradiction with the fact that u is rarefied, so the lemma is proved. □ 

Note that for any word v £ H(n+ 1) we have t(v) £ T m and n— [n/k\ < \t(v)\ < n — L(j>2 + l)/(& — 1)J , 
i.e. t(v) £ Q(n + 1) = U^n-^/fc/^ 1 ^ ^ C?')- So from Lemma [S] we obtain that |Q(n + l)| > \U(n + l)\ = 
k\\H'(n + 1)|. Denote by Q'(n + 1) the set of all trimmed words from Q(n + 1). Since by Proposition [5] any 
word from Q(n + 1) is rarified and has the length greater than -pi > k — 1, for any word from Q(n + 1) there 
exists a single word from Q'(n + 1) which is isomorphic to this word, and for any word from Q'(n + 1) there 
exist exactly k\ different words from Q(n + 1) which are isomorphic to this word. So |Q(n+l)| = fc!|Q'(n + l)|. 
Thus \Q'(n + 1)| > \H'(n + 1)|. Note that actually Q'(n + 1) = U"i^/2j /( *~ 1)J U=i Fm^ti) and, since 
all sets J-"m (J) ar e non-overlapping, 

iq>+i)i= £ Ei-^°C7)i< E ^w/(.r 4 

^ — * * — ' * — 4 %— l,...,s 

Thus, taking into account (fl5|) . wc obtain 

s 

V Xi Wi \n + l)\ < |#'(n + l)| • max Xi < \Q'{n + l)\ ■ max Xi 

' J i=l,...,s i=l,...,s 

"-L^J LtJ (19) 

< (.max Xi ) J2 S ™ (?')/(. ™* Xj) = /i £ (n - d). 

1 — 1 S ^ ' 2— 1,...,S * ' 

' ' i=»-L*J rf =L^J 



2.6. Getting a lower bound for 

Summing up estimation (|19[) with relation (|14p. we conclude that 



s b LtJ 

J2^ {Wi) (ri + l)\<Y,P<i-S^(n-d) + ^ Yl S™(n-d). (20) 

1=1 d=a *=L3£J 

For the sake of convenience we denote by V(z) the polynomial X)d= a Pd ■ z d in & variable z. Suppose for 
some a > 1 we have 

S^{n)>a d -S^{n-d) (21) 
for each d = 1, 2, . . . , n — m. Then relation (i^Ol) implies that 



s L J 

5>|#^>(n + l)| < sW( n )£0 + /i <?W( n ) Yl 

^ 1 oo 

< sW(n) P(-)+/x J] - 



i 



S^(n) (r(-) + 
\ a 



a«(a - 1) 



10 



where q = 77^7" J — 1- Using this estimation and equalities ([I]) and (|3j), we obtain 

s s 

S^(n + 1) = ■ \S im) (n + x&^'Hn + 1)1 



1=1 



\ a a q {a — 1 

Therefore, if a satisfy the inequality 



r-V{-)- M >a, 
a oflya — 1) 

we obtain the inequality Sm\n + 1) > aSm^ (n), and thus Sm\n + 1) > a d ■ (n + 1 — d) holds for any 
d=l,2,...,n — m + 1. If inequalities (|2ip hold for some n', then inequalities (pH]) hold inductively in this 
case for every n > n'. Thus we have Sm (n) = f2(a"). Since, obviously, the order of growth of S^(n) is 
not less than Sm\n), we then conclude that S^(n) = Cl{a n ). Hence 7^ > a. 

Note that for obtaining the bound y( k ' > a we have to prove initially that inequalities (f!?Tj) holds for n'. 
For these purposes we compute the exact values of (n) for n < no by an enumeration of all Dejean's 
words of size at most no- The inequalities Sffl (n + 1) > aSm^ (n) for uq < n < kp^/ik — 1) could be verified 
in the same inductive way as described above with evident modifications following from the restriction 
n < kp2/{k — 1). 



3. Results 

Using the described method of estimating 7^, we obtained lower bounds on 7W for 5 < k < 10. The 
obtained bounds together with the parameters m, no, pi, P2 used in the computer computations of these 
bounds are given in the following table. In this table we give also the upper bounds on 7'*) we obtain with 
the method described in [l8| • For the anti-dictionary A, we take the set of all binary minimally forbidden 
words in the Pansiot's code (w.r.t. factor containment) of size at most q. 



k 


m 


s 


no 


pi 


P2 


lower bound on 7^ 


9 


\A\ 


upper bound on 7^ 


5 


50 


5287 


150 


183 


600 


1.153811 


158 


12783585 


1.157895 


6 


33 


1926 


100 


125 


500 


1.223437 


113 


3946990 


1.224695 


7 


28 


318 


100 


126 


600 


1.236409 


114 


2958045 


1.236899 


8 


18 


31 


100 


119 


600 


1.234725 


118 


1399465 


1.234843 


9 


20 


42 


100 


123 


600 


1.246659 


112 


287646 


1.246678 


10 


22 


55 


100 


122 


600 


1.239287 


115 


65346 


1.239308 



Comparing the obtained lower bounds with the upper bounds on 7^ presented in the table, one can 
conclude that we have estimated 7^) for 5 < k < 10 with the precision of 0.005. 



4. Conclusion 

In this paper we obtained lower bounds on 7^ for 5 < k < 10, but we believe that by the method 
proposed in the paper lower bounds on 7^ could be computed for any fixed k > 5 (provided that 7^ > 1). 
So we consider as an interesting problem for further investigations the question if the computations described 
in the paper can be generalized theoretically for obtaining theoretical lower bounds on 7'*' valid for any 
k > 5. 
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